Automatic transcription of intonation using an identified prosodic alphabet

نویسنده

  • Stéphanie de Tournemire
چکیده

A solution is proposed for rapidly adapting prosodic models to a new voice or a new application. First, a prosodic alphabet that is supported by linguistic knowledge is identified at the acoustic level. The observation of the realisation of prosodic events on the acoustic corpus allows classes of breaks, F0 shapes and accents to be constructed and automatic transcription rules to be written. Then the transcribed corpus is used in the estimation of the parameters of a prosodic model for French. The good F0 contours and duration generated with the prosodic model verify the agreement of the identified alphabets to describe prosodic phenomena. Finally, the prosodic model is integrated in the CNET standard French Text-to-Speech Synthesis system. The quality of the generated prosody is considered by naïve listeners as equivalent to the handcrafted system. This result verifies the appropriateness of the alphabet as prosodic descriptors.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Levels of representation and levels of analysis for the description of intonation systems

It is argued that a satisfactory global theory of intonation will require four levels of analysis : (i) physical (acoustic, physiological) (ii) phonetic (iii) surface phonological and (iv) deep phonological. The theoretical and cognitive status of each level is discussed and specific proposals are made for a model respecting such an overall architecture as well as a condition of interpretabilit...

متن کامل

Speech Analysis for Automatic Evaluation of Shadowing

This paper presents acoustic analysis for the purpose of automatic evaluation of shadowing speech. We use selfchecked scores of understanding, manual prosodic scores, and TOEIC scores as reference scores of learners’ shadowing speech, and compare these scores with automatic scores based on acoustic features that can reflect phoneme intelligibility and prosodic fluency in terms of intonation, an...

متن کامل

Unit Selection Speech Synthesis Using Phonetic-Prosodic Description of Speech Databases

This paper describes an approach to speech synthesis based on using speech databases at different stages of TTS process. Speech database units are phones in different segmental and prosodic contexts. Pitch synchronous segmentation and labeling of databases allows storing both segmental and prosodic information. Phonetic-prosodic annotations of speech databases are involved in off-line training ...

متن کامل

SLAM: Automatic Stylization and Labelling of Speech Melody

This paper presents SLAM : a simple method for the automatic Stylization and LAbelling of speech Melody. This main contributions over existing methods are : the alphabet of melodic contours is fully data-driven, an explicit time-frequency representation is used to derive complex melodic contours, and melodic contours can be determined over arbitrary prosodic/syntactic units. Additionally, the s...

متن کامل

Automatic recognition of intonation from F0 contours using the rise/fall/connection model

This paper describes an automatic system for labelling intonational tune information based on the Rise/Fall/Connection model of intonation. The system is powerful in that it presupposes no prosodic knowledge of the utterance it is recognizing, and is capable of labelling all the intonational tune eeects of English.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998